This paper studies the problem of detecting the information source in anetwork in which the spread of information follows the popularSusceptible-Infected-Recovered (SIR) model. We assume all nodes in the networkare in the susceptible state initially except the information source which isin the infected state. Susceptible nodes may then be infected by infectednodes, and infected nodes may recover and will not be infected again afterrecovery. Given a snapshot of the network, from which we know all infectednodes but cannot distinguish susceptible nodes and recovered nodes, the problemis to find the information source based on the snapshot and the networktopology. We develop a sample path based approach where the estimator of theinformation source is chosen to be the root node associated with the samplepath that most likely leads to the observed snapshot. We prove forinfinite-trees, the estimator is a node that minimizes the maximum distance tothe infected nodes. A reverse-infection algorithm is proposed to find such anestimator in general graphs. We prove that for $g$-regular trees such that$gq>1,$ where $g$ is the node degree and $q$ is the infection probability, theestimator is within a constant distance from the actual source with a highprobability, independent of the number of infected nodes and the time thesnapshot is taken. Our simulation results show that for tree networks, theestimator produced by the reverse-infection algorithm is closer to the actualsource than the one identified by the closeness centrality heuristic. We thenfurther evaluate the performance of the reverse infection algorithm on severalreal world networks.
展开▼
机译:本文研究了在信息传播遵循流行的敏感感染恢复(SIR)模型的网络中检测信息源的问题。我们假定网络中的所有节点最初都处于易受感染状态,但处于感染状态的信息源除外。然后,易感节点可能会被受感染的节点感染,并且受感染的节点可能会恢复,并且在恢复后不会再次被感染。给定网络快照,我们可以从中了解所有受感染的节点,但无法区分易受感染的节点和已恢复的节点,问题在于根据快照和网络拓扑查找信息源。我们开发了一种基于样本路径的方法,其中选择信息源的估算器作为与样本路径相关联的根节点,该根节点最有可能导致观察到的快照。我们证明了对于无限树,估计器是一个节点,该节点使与感染节点的最大距离最小。提出了一种反向感染算法来在一般图中找到这种估计量。我们证明,对于$ g $的常规树,使得$ gq> 1,$(其中$ g $是节点度,而$ q $是感染概率),估计量与实际源相距恒定距离,且概率很高,独立感染节点的数量和快照的时间。我们的仿真结果表明,对于树状网络,逆向感染算法产生的估计量比接近度中心性启发式算法所确定的估计量更接近实际来源。然后,我们进一步评估反向感染算法在几个现实世界网络上的性能。
展开▼